|
TRE is an open-source library for pattern matching in text, which works like a regular expression engine with the ability to do approximate string matching. It is developed by Ville Laurikari and distributed under a 2-clause BSD-like license. The library is written in C and provides functions which allow using regular expressions for searching over input text lines. The main difference from other regular expression engines is that TRE can match text fragments in an approximate way, that is, supposing that text could have some number of typos. == Features == TRE uses extended regular expression syntax with the addition of "directions" for matching preceding fragment in approximate way. Each of such directions specifies how many typos are allowed for this fragment. Approximate matching is performed in a way similar to Levenshtein distance, which means that there are three types of typos 'recognized': * insertion of an extra character (''regullar experession''); * missing of a character from pattern (''reglar expession''); * replacement of some character (''regolar exprezsion''). TRE allows specifying of ''cost'' for each of three typos type independently. The project comes with a command-line utility, a reimplementation of agrep. Though approximate matching requires some syntax extension, when this feature is not used, TRE works like most of other regular expression matching engines. This means that * it implements ordinary regular expressions written for strict matching; * programmers familiar with POSIX-style regular expressions need not do much study to be able to use TRE. 抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「TRE (computing)」の詳細全文を読む スポンサード リンク
|